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The megakaryoblastic leukemia 1 (MKLl) protein functions as a transcriptional coactivator of the serum 
response factor. MKLl has three RPEL motifs (RPELl, RPEL2, and RPEL3) in its N-terminal region. MKLl 
binds to monomeric G-actin through RPEL motifs, and the dissociation of MKLl from G-actin promotes 
the translocation of MKLl to the nucleus. Although structural data are available for RPEL motifs of MKLl in 
complex with G-actin, the structural characteristics of RPEL motifs in the free state have been poorly 
defined. Here we characterized the structures of free RPEL motifs using NMR and CD spectroscopy. NMR 
and CD measurements showed that free RPEL motifs are largely unstructured in solution. However, NMR 
analysis identified transient a-helices in the regions where helices al and a2 are induced upon binding to 
G-actin. Proline mutagenesis showed that the transient a-helices are locally formed without helix-helix 
interactions. The helix content is higher in the order of RPELl, RPEL2, and RPEL3. The amount of 
preformed structure may correlate with the binding affinity between the intrinsically disordered protein and 
its target molecule. 

The megakaryoblastic leukemia 1 (MKLl) protein was originally identified in a study of chromosomal 
translocation in infants and children with acute megakaryocytic leukemia^'^. It has been implicated in cancer 
cell migration and invasion^'^ as well as in the regulation of neurite outgrowth^"^ and dendritic complex- 
ity^-io MKLl is a member of the myocardin-related transcription factor family, which regulates essential bio- 
logical processes, including the development and differentiation of cells. MKLl functions as a transcriptional 
coactivator of the serum response factor (SRF) in the cell nucleus. It also functions as a G-actin-binding protein. 
The localization of MKLl in cells is regulated by the monomeric G-actin level in the cytoplasm — namely, actin 
polymerization. Actin polymerization is induced by the activation of RhoA signaling, which decreases the 
monomeric G-actin level in the cytoplasm. The depletion of G-actin in the cytoplasm results in the dissociation 
of MKLl from G-actin. As a result, MKLl translocates from the cytoplasm to the nucleus through the importin a/ 
Pl heterodimer^\ In the nucleus, MKLl collaborates with SRF to induce the transcription of a number of genes, 
including actin, c-fos, and SRF itself 

Rat MKLl (GenBank accession number: BAN82605.1) is a 1038 amino acid protein that has an N-terminal 
actin-binding RPEL domain, basic boxes, a glutamine-rich domain, an SAP domain, a leucine zipper-like domain, 
and a transactivation domain^^'^^. The RPEL domain consists of three RPEL motifs, each of which functions as an 
actin-binding element^^. RPEL2 and RPEL3 each have a core sequence of RPxxxEL, while RPELl has a non- 
canonical RRxxxEL core sequence (Figure la). The crystal structures of RPELl and RPEL2 in complex with G- 
actin have been reported^^. In these complexes, the RPEL motif adopts two a-helices (helices al and a2) and binds 
to the hydrophobic cleft and hydrophobic ledge of G-actin (Figure lb). The complex structure also suggests that 
side chains of L136, K139, 1140, R143, L149, 1154, and L155 of RPEL2 are essential for the interaction with G-actin 
(Figure Ic)'^ 

Although the structure of the RPEL motif in complex with G-actin has been reported^°, little is known about the 
conformational properties of RPEL motifs in the free state. This information is necessary to fully understand the 
translocation of MKLl from the cytoplasm to the nucleus, since the translocation requires the dissociation of 
MKLl from G-actin. In this study, the structure of the free RPEL motif is investigated by nuclear magnetic 
resonance (NMR) and circular dichroism (CD) spectroscopy. 
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Figure 1 | (a) Amino acid sequences of RPEL motifs from rat MKLl. The positions of helices al and a2 of RPELl in complex with G-actin are indicated. 
The sequences of RPxxxEL and RRxxxEL are underlined. Numbering is based on GenBank accession number BAN82605. 1 . Asterisks indicate the essential 
residues for actin-binding. (b) Three-dimensional structure of RPEL2 bound to G-actin (PDB entry 2V52). The positions of helices al and a2 are 
indicated, (c) The interaction region between RPEL2 and G-actin (PDB entry 2V52). G-actin is shown in surface and cartoon representations. RPEL2 is 
shown in stick representations and essential residues for actin-binding are shown in red. The side chain of K139 in RPEL2 adopts two conformations, both 
interacting with G-actin^°. (d) Prediction of disorder tendency of full-length MKLl by the lUPred predictor^^. Scores above a threshold value of 0.5 are 
considered to be disordered. The positions of RPELl, RPEL2, and RPEL3 are indicated. 



Results 

CD analysis of RPELs. We analyzed the secondary structure of 
RPELs using CD spectroscopy. The CD spectra of RPELs exhi- 
bited a strong negative band near 200 nm and a weak negative 
shoulder at 220 nm, which is characteristic of unfolded polypep- 
tides^^ (Figure 2). This is supported by the prediction of structural 
disorder by lUPred'^ which indicates that RPELl, RPEL2, and 
RPEL3 are disordered (Figure Id). The negative band near 200 nm 
is stronger in the order of RPELS > RPEL2 > RPELl (Figure 2). 
Although free RPEL motifs are largely unstructured, analysis of the 
helix content shows that RPEL adopts a small but significant amount 
of ot-helix conformation. The helix content is higher in the order of 
RPELl > RPEL2 > RPELS (Table 1). 

Chemical shift analysis of RPELl. The conformational properties of 
RPELl in the free state were investigated by NMR spectroscopy 
(Figure S). NMR spectra were measured at 15°C to reduce hydro- 
gen exchange. Backbone resonance assignments were obtained with 
standard triple-resonance NMR experiments. The ^H-^^N HSQC 
spectrum of RPELl shows the backbone amide resonances within 
8.0-8.6 ppm in the dimension (Figure S). The narrow chemical 
shift dispersion in the dimension is characteristic of intrinsically 
disordered proteins (IDPs)^^'^^. 

NMR chemical shifts can be used to probe the propensity of pro- 
teins to adopt an a-hehcal conformation and extended (p-strand) 
conformation at the residue level. In order to investigate the 



conformational properties of RPEL, we utilized the secondary struc- 
ture propensity (SSP) program^^. The SSP program combines chem- 
ical shifts from different nuclei into a single score indicating the 
secondary structure propensity. A positive SSP score indicates a 
propensity for ot-helical conformation, and a negative score indicates 
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Figure 2 | CD spectra of RPELl (solid line), RPEL2 (dotted line), and 
RPELS (dashed line) at pH 7.0 and 25°C in the far-UV region. 
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Table 1 


Helix content of the RPEL motif calculated from far-UV CD spectra 






Helix content (%) 






the self-consistent method^^ the CONTIN method^^ 


the variable-selection method^^ 


RPELl 
RPEL2 
RPEL3 


16 13 
14 1 1 
6.7 5.0 


7.1 
5.1 

0.7 



a propensity for extended (p-strand) conformation (Figure 4a). The 
SSP scores represent the expected fraction of a-hehcal or extended 
(p-strand) conformation at a given residue^^ For instance, an SSP 
score of 0.5 indicates that 50% of the conformers in the disordered 
state ensemble adopt an ot-hehcal conformation at that particular 
position. In order to examine the positions of helices, we utilized 
the difference between ^^C"' secondary shifts and ^^C"^ secondary shifts 
{AbC^ - A5CP) (Figure 4b). Consecutive positive values of ASC^ - 
ASC*^ indicate a propensity to adopt an ot-helical structure, and con- 
secutive negative values indicate an extended (p-strand) structure^l 
Since appropriate random coil chemical shifts are important to 
obtain reliable secondary structure propensities, we used the random 
coil chemical shifts suitable for IDPs^^. 

The residues from Asn90 to Gln97 and from Argl02 to Serl07 
have a propensity to adopt an a-helical conformation, indicating that 
helices al and a2 are formed in free RPELl (Figure 4b). This does not 
mean, however, that the helix al and a2 are stably formed in free 
RPELl, since the SSP scores are below 0.5 (Figure 4a). The averaged 
SSP score is —19% for the helix al region (residues 90-97) and 
—26% for the helix a2 region (residues 102-108). Therefore, helices 
al and a2 are transiently formed in free RPELl. 

Proline mutagenesis of RPELl. We confirmed the transient a-helix 
formation of RPELl using proline mutagenesis. Proline mutation 
unfolds or greatly destabilizes the protein structure when inserted 
in the middle of secondary structures^^'^^. Mutation of Leu94 to Pro 
reduced the a-helical propensity in the helix al region, indicating the 
helix formation. However, the mutation of Leu94 to Pro had little 
effect on the helical propensity in the a2 region (Figure 5a). In 
addition, the mutation of Leul05 to Pro reduced the a-helical 
propensity in the a2 region of RPELl but had little effect on the 
helical propensity in the al region (Figure 5b). These results indi- 
cate that the transient a-helices in free RPELl are independently 
formed without helix-helix interactions. 
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Figure 3 | The ^H-^'N HSQC spectrum of RPELl at pH 7.0 and 15°C. The 

backbone resonance assignments are indicated. Numbering is based on 
GenBank accession number BAN82605.1. 



Chemical shift analysis of RPEL2 and RPEL3. We also investigated 
the conformational propensity of RPEL2 and RPEL3 using NMR 
spectroscopy (Figure 6). The residues from Thrl32 to Lysl39 and 
from Leu 149 to Met 152 of RPEL2 have a propensity to adopt an a- 
helical conformation (Figure 6a). On the other hand, RPEL3 exhibits 
no significant helical propensity (Figure 6b). Together, our results 
suggest that the helices al and a2 are transiently formed in RPELl 
and RPEL2, while the helix is not formed in RPEL3. The helix content 
is higher in the order of RPELl > RPEL2 > RPEL3 (Figures 4 and 6). 

Discussion 

Many IDPs undergo a disorder-to-order transition upon binding to 
their target molecule, a process that has been called "coupled folding 
and binding^^." There are two models describing the mechanism 
of the disorder-to-order transition — namely, the conformational 
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Figure 4 | (a) SSP scores of RPELl. (b) ASC"" - A5C'^ secondary chemical 
shifts of RPELl. The positions of helices al and a2 are indicated in each 
panel. 
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Figure 5 | A8C" - ASC* secondary chemical shifts of the (a) L94P and (b) 
L105P mutant of RPELl. The positions of helices al and a2 are indicated 
in each panel. The positions of the mutations are indicated by arrows. 

selection model and the induced fit modeP^. In both models, IDPs are 
largely unstructured before binding to their target molecules. On the 
other hand, a number of studies have shown that the disordered state 
ensemble contains a significant amount of native-like secondary 
structure^ However, the significance of the native-like secondary 
structure in the disorder-to-order transition is not fully understood. 

In the present study, we investigated the conformational prop- 
erties of RPELl -3 in the free state using NMR and CD spectroscopy. 
The isolated RPEL motifs are largely disordered in solution, while 
RPELs acquire the tertiary structure upon binding to the monomeric 
G-actin^°. According to the crystal structures of RPELl and RPEL2 in 
complex with G-actin, the RPEL motif adopts two a-helices and 
binds to the hydrophobic cleft and the hydrophobic ledge of G-actin. 
The essential residues for actin binding are completely conserved 
between RPEL2 and RPELS, while the essential residues are partially 
different between RPELl and RPEL2: Ilel40 and Leul55 of RPEL2 
are substituted with Leu and Met in RPELl, although Leul36, 
Lysl39, Argl43, Leul49, and Ilel54 of RPEL2 are conserved in 
RPELP^ 

Our experimental data indicate that free RPELs contain a-helical 
structures. The a-helical structure is partially and transiently formed 
in the regions where the helices al and a2 are induced upon binding 
to G-actin^°. Generally, the structural disorder of IDP in solution 
raises the entropic penalty when IDP undergoes the disorder-to- 
order transition upon binding to its target molecule. The preformed 
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Figure 6 | A6C« - ASC^ secondary chemical shifts of (a) RPEL2 and (b) 
RPEL3. The positions of helices al and a2 are indicated in each panel. 

ot-helices of IDP may decrease the entropic penalty of the disorder- 
to-order transition^\ 

Mouilleron et al have shown that RPELl and RPEL2 bind to G- 
actin tightly, with dissociation constants {K^) of 1.0 and 1.9 |aM, 
respectively^^. They also showed that RPEL3 binds weakly to G-actin 
with a iCd value of 28.9 |aM, although the essential residues for the 
interaction with G-actin are conserved in RPEL3. From these 
values, the Gibbs free energy changes (AG°) of binding are 
-8.2 kcal/mol, -7.8 kcal/mol, and -6.2 kcal/mol for RPELl, 
RPEL2, and RPEL3, respectively. The helical propensity is higher 
in the order of RPELl, RPEL2, and RPEL3. These results suggest 
that the amount of preformed structure may correlate with the 
AG° value of binding. It is conceivable that the binding strength 
between IDP and the target molecule is modulated by the preformed 
structural elements in free IDP^\ A previous study suggests that the 
preformed secondary structure in IDP is an important determinant 
for the interaction between IDP and its target molecule^^. 
lesmantavicius et al. demonstrated that the helical propensity in 
the activator domain of the activator for thyroid and retinoid recep- 
tors ( ACTR) modulates its binding to the nuclear coactivator binding 
domain (NCBD) of the CREB binding protein, both in terms of 
association and dissociation, which results in the increased affinity 
between ACTR and NCBD'l 

Proline mutagenesis shows that the transient a-helices in RPELl 
are locally formed without helix-helix interactions. In contrast to our 
observation, long-range helix-helix interactions are formed in the 
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unfolded state of other proteins such as ACTR^^, acyl coenzyme A 
binding protein^^, and hepatitis C virus protein NS5A^^. A key dif- 
ference seems to be that these other proteins have amphipathic heh- 
ces with a clear hydrophobic side that is likely to form helix-helix 
interactions. The difference may also result from the length of a- 
helices. The a-helices of RPEL are shorter than those of the other 
unfolded proteins. 

The RPEL motif of MKLl is an ensemble of conformations ran- 
ging from random coils to native-like a-helices. Our results suggest 
that the amount of preformed structure may correlate with the bind- 
ing strength between IDF and the target molecule. In order to fully 
understand the mechanism of the disorder-to-order transition, it will 
be useful to investigate the binding kinetics of RPEL with varying 
heUcal propensity to G-actin^^. 

Methods 

Protein expression and purification. MKL1(85-116), MKLl (129- 160), and 
MKLl (173-204) were expressed as GST-fusion proteins^*^. The DNAs encoding 
MKL1(85-116), MKL1(129-160), and MKLl(173-204) were inserted into a pOP3C 
plasmid. Sequencing of the inserted DNA was performed on an ABI PRISM 3130 
Genetic Analyzer (Applied Biosystems). The GST-fusion proteins were expressed in 
C41(DE3)RIPL cells harboring the pOP3C plasmid. The cells were grown in M9 
minimal medium supplemented with ^^NH4C1 and ^^C-glucose. Protein expression 
was induced by the addition of isopropyl-P-D-thiogalactopyranoside at a final 
concentration of 0.3 mM. After 5-h cultivation, the cells were collected by 
centrifugation at 3765 g for 15 min at 4°C. 

The C41(DE3)RIPL cells were resuspended in buffer A (20 mM Tris-HCl, 
100 mM NaCl, and 1 mM dithiothreitol at pH 7.5) and lysed by sonication on ice. 
The proteins were detected in a soluble fraction after centrifugation at 9400 g for 
60 min at 4°C. The GST-fusion proteins were purified with a glutathione Sepharose 
4B column (GE Healthcare Bio-Sciences) equilibrated with buffer A. The purified 
GST-fusion protein was digested on a column with 3C protease for 15 h at 4°C to 
remove the GST region. The flow-through containing RPELs was collected by adding 
buffer A to the column, and RPELs were further purified by high-performance liquid 
chromatography with a COSMOSIL C-18 AR-II column (Nacalai Tesque). The 
solutions containing RPELs were collected and dialyzed against buffer B (10 mM 
sodium phosphate and 50 mM NaCl at pH 7.0). The purified RPELs have an N- 
terminal extension (GPHM) derived from the pOP3C plasmid. Matrix- assisted laser 
desorption ionization time- of- flight mass analysis (MALDl-TOF MS) was used to 
confirm the molecular weights of the RPELs. MALDl-TOF MS was performed on a 
Bruker Daltonics Autoflex-Tl mass spectrometer. 

NMR spectroscopy. For NMR experiments, the proteins in buffer B were 
concentrated to 0.12-0.50 mM and D2O was added to a final concentration of 7%. 
The NMR samples also contained 1 mM NaNs and 20 [lM 2,2-dimethyl-2- 
silapentane- 5 -sulfonate sodium salt. All NMR experiments were performed on a 
Bruker Avance 800 MHz spectrometer equipped with a cryoprobe. ^H^, ^^N, ^^C', 
^^C", and ^^C^ resonances were assigned using standard three-dimensional triple 
resonance experiments including CBCANH^^ CBCA(CO)NH3^ HNCO'^ 
HN(CA)CO'^ HNCA3^ and HN(CO)CA^^ The assignments have been deposited in 
the BioMagResBank under BMRB accession numbers 11564, 11565, 11566, 11567 
and 11568. The ^H chemical shifts were directly referenced to the resonance of 2,2- 
dimethyl-2-silapentane-5-sulfonate sodium salt, while the ^^C and ^^N chemical shifts 
were indirectly referenced with the absolute frequency ratios^". NMR data were 
processed with NMRPipe*^ and analyzed with NMRView*^. 

The random coil chemical shifts of Kjaergaard et al.^^ were used to calculate the ^^C" 
secondary shifts and the ^^C^ secondary shifts. The SSP scores were calculated with the 
random coil chemical shifts^*^ and the average secondary shifts for the fully formed 
secondary structure^^ as described previously^^ An averaging window of five residues 
was applied in the SSP analysis as described previously^^. The SSP scores were cal- 
culated using calibrated ^^C" and ^^C*^ chemical shifts as inputs. 

CD spectroscopy. CD spectra were measured using a J- 805 spectropolarimeter 
(JASCO) at 25°C. Sample solutions contained 0.25-0.39 mM RPEL, 10 mM sodium 
phosphate, and 50 mM NaCl (pH 7.0). A quartz cell with a 0.2 mm path length was 
used for all measurements. The content of the secondary structure was estimated 
using the CONTIN method, the self- consistent method, and the variable- selection 
method^''"'*'^. The Dicroprot program*^ and the Dicroprot server''*^ were used for the 
calculation. 
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